Computationally Efficient Body-Conducted Voice Conversion with Original Excitation Signals
نویسندگان
چکیده
In this paper, we propose a computationally efficient method of body-conducted voice conversion. A body-conducted voice is robust against to external noise but its voice quality is severely degraded by mechanisms of body-conduction. The conventional body-conducted voice conversion method effectively enhances the body-conducted voice by converting both spectral and excitation features. On the other hand, its computational cost is relatively high. To significantly reduce the computational cost while keeping the enhanced voice quality as high as possible, we propose a conversion method of using an original excitation signal of the body-conducted voice and computationally efficient feature extraction. The effectiveness of the proposed method is confirmed in the objective and subjective evaluations.
منابع مشابه
Direct F0 control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation
An electrolarynx is a device that artificially generates excitation sounds to enable laryngectomees to produce electrolaryngeal (EL) speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produced by the device. To address this issue, we have proposed several EL speech enhancement methods using statistical v...
متن کاملImplementation of Computationally Efficient Real-Time Voice Conversion
This paper presents an implementation of real-time processing of statistical voice conversion (VC) based on Gaussian mixture models (GMMs). To develop VC applications for enhancing our human-to-human speech communication, it is essential to implement real-time conversion processing. Moreover, it is useful to reduce computational complexity of the conversion processing for making VC applications...
متن کاملMon.O1d.05 Implementation of Computationally Efficient Real-Time Voice Conversion
This paper presents an implementation of real-time processing of statistical voice conversion (VC) based on Gaussian mixture models (GMMs). To develop VC applications for enhancing our human-to-human speech communication, it is essential to implement real-time conversion processing. Moreover, it is useful to reduce computational complexity of the conversion processing for making VC applications...
متن کاملOn the limitations of voice conversion techniques in emotion identification tasks
The growing interest in emotional speech synthesis urges effective emotion conversion techniques to be explored. This paper estimates the relevance of three speech components (spectral envelope, residual excitation and prosody) for synthesizing identifiable emotional speech, in order to be able to customize voice conversion techniques to the specific characteristics of each emotion. The analysi...
متن کاملNew algorithm for LPC residual estimation from LSF vectors for a voice conversion system
Voice conversion involves transforming segments of speech from a source speaker to make them to be perceived as if spoken by a target speaker. Generally, this process involves the estimation of vocal tract parameters and an excitation signal that match the target speaker. The work presented here proposes an algorithm for estimating the excitation residuals of the target speaker using a weighted...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011